Picture for Chengyu Wang

Chengyu Wang

Mock Worlds, Real Skills: Building Small Agentic Language Models with Synthetic Tasks, Simulated Environments, and Rubric-Based Rewards

Add code
Jan 30, 2026
Viaarxiv icon

VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning

Add code
Jan 29, 2026
Viaarxiv icon

An Information-Theoretic Framework for Robust Large Language Model Editing

Add code
Dec 18, 2025
Figure 1 for An Information-Theoretic Framework for Robust Large Language Model Editing
Figure 2 for An Information-Theoretic Framework for Robust Large Language Model Editing
Figure 3 for An Information-Theoretic Framework for Robust Large Language Model Editing
Figure 4 for An Information-Theoretic Framework for Robust Large Language Model Editing
Viaarxiv icon

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Add code
Oct 10, 2025
Viaarxiv icon

Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing

Add code
May 29, 2025
Viaarxiv icon

EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models

Add code
May 27, 2025
Viaarxiv icon

UniEdit: A Unified Knowledge Editing Benchmark for Large Language Models

Add code
May 18, 2025
Viaarxiv icon

BELLE: A Bi-Level Multi-Agent Reasoning Framework for Multi-Hop Question Answering

Add code
May 17, 2025
Viaarxiv icon

Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations

Add code
May 16, 2025
Figure 1 for Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Figure 2 for Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Figure 3 for Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Figure 4 for Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Viaarxiv icon

DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models

Add code
Apr 21, 2025
Viaarxiv icon